Development of a prosodic reading tutor of Japanese - effective use of TTS and F0 contour modeling techniques for CALL

نویسندگان

  • Nobuaki Minematsu
  • Hiroya Hashimoto
  • Hiroko Hirano
  • Daisuke Saito
چکیده

A text typed to a speech synthesizer is generally converted into its corresponding phoneme sequence on which various kinds of prosodic symbols are attached by a prosody prediction module. By using this module effectively, we build a prosodic reading tutor of Japanese, called Suzuki-kun, and it is provided as one function of OJAD (Online Japanese Accent Dictionary) [1]. In Suzuki-kun, by using a prosody prediction module, any Japanese text is converted into its reading (Hiragana sequence) on which the pitch pattern that sounds natural is visualized as a smooth curve drawn by the F0 contour generation process model [2]. Further, positions of accent nuclei and unvoiced vowels are illustrated. Suzuki-kun also reads that text out following the prosodic features that are visualized. Since releasing Suzuki-kun, the number of accesses to OJAD has been drastically increased and for the last four months, OJAD received 129,168 accesses, 58.9 % of which were from outside Japan.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic Reading Tutor of Japanese, Suzuki-kun: The first and only educational tool to teach the formal Japanese

A text typed to a speech synthesizer is generally converted into its corresponding phoneme sequence on which various kinds of prosodic symbols are attached by a prosody prediction module. By using this module effectively, we build a prosodic reading tutor of Japanese, called Suzuki-kun, and it is provided as one feature of OJAD (Online Japanese Accent Dictionary) [1, 2]. In Suzuki-kun, any Japa...

متن کامل

Design and Development of a Prosody Generator for Arabic TTS Systems

Prosody modeling has become the backbone of TTS synthesis systems. Amongst all the prosodic modeling approaches, phonetic methods aiming to predict duration and F0 contour are being very praised, thanks to the development of regression tools, such as neural networks (NN). Besides, parametric representations like Fujisaki model for F0 contour generation help to reduce the problem into the approx...

متن کامل

Puretalk: a high quality Japanese text-to-speech system

This paper describes a high quality Japanese text to speech (TTS) system, PureTalk. This system is similar to the conventional diphone-based TTS using PSOLA except that PureTalk employs the following novel techniques which enable to produce more intelligible and natural-sounding speech: 1) two-stage duration modeling based on a linear regression technique, 2) F0 contour modeling using polynomia...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

OJAD: Web-based Prosodic Reading Tutor of Japanese

Learning prosodic control for speaking Japanese is effective to reduce syntactic and lexical ambiguity and to improve the comprehensibility of learners’ spoken Japanese. Good prosody can also improve its naturalness. In the conventional curriculum however, prosody training has not been provided satisfactorily for learners partly because of scarcity of teaching materials. To facilitate prosody t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015